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SECTION  I 


INTRODUCTION 

I. I  BACKGROUND  AND  PURPOSE 

The  general  mathematical  form  of  many  software  resource  models 
is  reducible  to  the  expression 

3 

DE  =  al  II f  j  (j  «  1,  n)  and  a,  0  >  0  (1) 

where  DE  is  the  development  effort  measured  in  staff  months,  a  and  3 
are  constants  specific  to  the  model  type,  I  denotes  the  size  of  the 
Computer  Software  Configuration  Item  (CSCI)  measured  in  thousands  of 
delivered  source  instructions  (KDSI),  and  IXf j  is  the  product  of  n 
development  effort  multipliers  which  depict  unique  attributes  of  the 
software  product,  host  computer  hardware,  programmer  and  analyst 
experience,  and  product  schedule.  If  the  size  I  of  a  CSCI  is  a 
random  variable  with  a  given  probability  distribution,  then  DE  is  a 
random  variable ;  and  closed  form  expressions  for  the  probability 
distribution  of  DE  and  the  mean  and  variance  of  DE  may  be  derived 
for  selected  probability  distributions. 

The  purpose  of  this  paper  is  to  provide  the  closed  form 
equations  for  the  density  and  distribution  of  DE  and  the  mean  and 
variance  of  DE  for  a  set  of  prescribed  size  probability  distribu¬ 
tions.  These  equations  are  needed  for  determining  analytically  the 
effort  uncertainty  due  to  the  size  uncertainty  associated  with  the 
development  of  the  Computer  Software  Configuration  Item. 

This  analysis  does  not  preclude  the  fact  that  there  exist  other 
technical  and  programmatic  uncertainties  which  impact  the  develop¬ 
ment  of  software.  Multiple  site  development,  subcontracting, 
personnel  experience,  concurrent  hardware  development,  and  the 
availability  of  a  sound  programming  support  environment  are 


additional  sources  of  uncertainty  which  should  be  quantified  when  a 
formal  software  risk  analysis  is  performed. 

1.2  SCOPE  OF  THE  PAPER 

Two  classes  of  size  probability  distribution  functions  will 
be  defined  for  this  analysis.  They  are  the: 

o  Uniform  Distribution 

o  Triangular  Distribution 

Within  their  permissable  intervals  of  definition,  the  densities  of 
these  size  probability  distribution  functions  are  characterized  by 
the  common  feature  of  unimodality  and  finite  range.  In  general, 
these  densities  are  typical  of  the  shapes  encountered  when  size 
assessments  for  a  CSCI  are  made. 

For  each  size  probability  distribution  function  described 
above,  Section  2  provides  equations  which  define  the  exact 
development  effort  probability  distribution  and  the  exact  effort 
mean  and  variance  specific  to  that  CSCI.  Section  3  presents  a 
graphical  representation  of  the  effort  probability  distributions 
which  arise  when  the  size  of  a  CSCI  is  transformed  to  the  unit 
interval.  This  is  useful  for  determining  the  effort  associated  with 
a  specified  level  of  risk.  Section  4  introduces  the  Analytic 
Software  Effort  Probability  (ASEP)  model.  The  ASEP  model  is  an 
extension  of  the  equations  developed  in  Section  2  for  the  purpose  of 
approximating  the  development  effort  mean,  variance,  and 
distribution  for  a  system  consisting  of  multiple  CSCIs.  Appendix  A 
provides  case  studies  which  describe  how  the  ASEP  model  is  applied 
to  single  and  multiple  CSCI  systems. 


SECTION  2 


MEASURING  EFFORT  UNCERTAINTY  IN  a  1^  SOFTWARE 
RESOURCE  MODELS  -  AN  ANALYTIC  APPROACH 

2.1  INTRODUCTION 

There  are  a  number  of  software  resource  models  which  are 
reducible  to  the  form  shown  in  equation  l.  Examples  of  such  models 
are  the  Constructive  Cost  Model  (COCOMO)  [1],  the  Jensen  Model  [2], 
the  Doty  Model  [5],  and  the  Walston-Felix  Model  [5],  It  is  not 
within  the  scope  of  this  paper  to  discuss  the  details  of  these 
software  resource  models.  The  reader  is  directed  to  the  references 
if  such  information  is  desired.  The  mathematical  analysis  presented 
in  this  paper  is  focused  on  determining  analytical  expressions  for 
the  probability  distribution  of  DE  and  the  mean  and  variance  of  DE, 
given  that  the  size  I  of  a  CSCI  is  a  random  variable  from  one  of  the 
classes  of  size  probability  distributions  described  in  Section  2.2. 
The  analysis  methodology  is  general  to  any  software  resource  model 
of  the  form 


DE  =  aie  a,P  >  0  (1) 

0!  =  aJIfj  (j  *  i ,  ■  •  •  j  n) 

where  DE  is  the  development  effort  measured  in  staff  months,  I 
denotes  the  size  of  the  CSCI  measured  in  thousands  of  delivered 
source  instructions,  a  and  g  are  constants  specific  to  the  model 
type,  and  Ilf j  is  the  product  of  n  development  effort  multipliers 
which  describe  unique  characteristics  of  the  software  product,  host 
computer  hardware,  programmer  and  analyst  experience,  and  product 
schedule . 
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2.2  SELECTED  CLASSES  OF  SIZE  PROBABILITY  DISTRIBUTIONS 


Given  that  the  size  I  of  a  CSCI  is  a  random  variable,  two 
classes  of  size  probability  distribution  functions  are  discussed  in 
this  section.  They  are  the: 

o  Uniform  Distribution 
o  Triangular  Distribution 

Other  probability  distribution  functions  such  as  the  beta 
distribution  may  be  used  to  define  the  size  uncertainty  of  a  CSCI, 
however,  the  mathemat ical  analysis  in  this  paper  quickly  becomes 
unwieldy  if  size  is  beta  distributed.  The  uniform  and  triangular 
distributions  may  be  considered  robust  in  the  sense  that  they 
describe  reasonably  well  a  wide  variety  of  size  distributions,  even 
though  they  may  not  always  be  the  most  precise  representation  of  the 
size  uncertainty  for  a  particular  CSCI.  Furthermore,  given  the 
degree  of  uncertainty  often  attributed  to  size  estimates,  and  the 
subjectivity  by  which  a  size  distribution  is  ascribed  to  that  CSCI, 
the  uniform  and  triangular  distribution  functions  can  be  adequately 
defined  in  order  to  bound  this  uncertainty.  One  approach  to  bounding 
this  uncertainty  is  to  have  the  size  estimator  determine,  not  an 
absolute  size  range,  but  an  80th,  or  90th  percentile  confidence 
interval  from  which  the  analyst  may  then  compute  the  absolute 
extremes  of  the  size  interval. 

The  distribution  function  for  I,  denoted  by  F^.(t),  is  a 
function  of  a  real  variable  t  such  that 

Fj(t)  =  Prob(I  <_  t)  **  /  fj(u)du 

-00 

where  f^  is  the  probability  density  function  for  size  I. 


Assuming  that  the  software  development  effort  DE  is  given  by 


DE  =  al3  =  g(I)  (2) 

where  1^0  and  o: , 3  >  0  then  g(I)  is  a  strictly  monotonic  function, 
and  g  ^(1)  exists  uniquely.  Therefore,  the  probability  distribution 
function  of  DE,  denoted  by  FDE,  is  given  in  equation  3. 

fde(x)  =  Fi^s  (3) 

This  is  true  since  g  is  a  strictly  monotonic  increasing  function, 
thus 


Fqe(x)  =  Prob(DE  x)  by  definition 


*  Prob(g(I)  <  x) 

«  Prob(I  <  g_1(x)) 

*  Fx(g  1(x>) 

where  gd)  <_  x  if  and  only  if  I  <  g  3(x).  Since  g  is  a  strictly 
monotonic  dif ferentiahle  function,  then  the  density  of  DE,  denoted 
by  fjj  is  given  below  in  equation  4. 

fDS(x)  “  fI^8  d(g  1(x))/dx  (4) 


Equation  5  defines  the  expected  value  of  the  development  effort,  and 
is  denoted  by  E(DE). 


E(DE)  =  E(g(I)) 


g(u)fT(u)du 


(5) 
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The  effort  variance  o2D£  is  defined  by 


o 


DE 


E«g(I)~  ^g(i))2) 


-  a*E  (I23)  -  (E(DE))2  (6) 


where 


E  (I23) 


f  23 

“  J u  f jCuJdu 


(7) 


The  following  subsections  describe  the  size  probability  density 
fj(t)  and  distribution  functions  Fj(t)  for  each  of  the  two  classes 
considered  in  this  paper.  Equations  for  the  effort  probability 
distribution,  the  effort  density,  the  effort  mean  and  variance  for 
each  of  the  size  probability  distributions  will  be  provided  in 
Section  2.3. 


2.2.1  The  Uniform  Distribution 


The  simplest  possible  continuous  random  variable  is  the  uniform 
random  variable.  If  size  I  is  uniformly  distributed  on  the  interval 
ta  <  t  <  tb,  that  is 


1  ~  Unif^a>  tb) 


then  the  probability  density  function  of  I,  denoted  by  fj,  is 


fx(t) 


l/(t,-t  )  if  t  <  t  <  t. 
b  a  a  t 


0  otherwise 


(8) 


The  graph  of  fx  is  shown  in  figure  1. 


Figure  1 

The  Uniform  Density  Function 


The  probability  distribution  function  of  I,  assuming  that  I  is 
uniformly  distributed  on  the  interval  ta  t  tb>  is  given  below  in 
equation  9. 

0  if  t  <  t  (9) 

a 

Fl(t)  =  (t-ta)/(tb-ta)  if  ta  <  t  <  tb 

i  1  if  t  >  tb 

2.2.2  The  Triangular  Distribution 

If  the  size  range  of  a  CSCI  can  be  expressed  by  three  points 

t  ,  t  ,  t.  where  t  is  the  lowest  possible  size  estimate,  t  is  the 
a  in  b  s. 

modal  size  estimate  (informally  referred  to  as  the  most  likely),  and 

t  is  the  highest  possible  size  estimate,  then  a  triangular  density 
b 

function  may  be  suitable  for  describing  the  size  distribution  of  the 
Computer  Software  Configuration  Item  (CSCI).  Therefore,  if  I  is 
triangularly  distributed,  that  is: 

I  ~  Trng  (ta,  t^,  tb) 
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where  t  <  t  <  t,  .  then  the  probability  density  function  of  I, 
a  —  m  —  b 

denoted  by  fj.  is 

(c/(t  -t))(t-t  )  if  t  <  t  <  t 

ID  3  3  3  m 

£,(t)  -  (10) 

l(c/(tm-tb»(t-t.b)  If  s,  <  1 

where  c  =  2 / 

The  graph  of  fj-  is  shown  in  figure  2. 


Figure  2 


The  Triangular  Density  Function 

The  probability  distribution  function  of  I,  assuming  that  I  is 
triangularly  distributed  on  the  interval  tg  <  <  t^  is  given  below 

in  equation  11. 

0  if  t  <  t 

a 

'  <  f»  <U> 

Fj( t )  =  • 

(  1+<tb-ta>'1<Vtb>"1(t-tb>2  if  '»,<'<  rb 

1  if  t  >  tb 
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2.2.3  The  Right  Triangular  Distribution 

The  Right  Triangular  Distribution  is  a  special  case  of  the 

general  triangular  distribution  described  in  the  previous 

subsection.  If  the  size  range  of  a  CSCI  can  be  characterized  by  the 

role  that  the  low  size  estimate  is  equal  to  the  modal  size  estimate 

(i.e..  t  =  t  )  and  if  there  exists  only  upward  size  uncertainty  given 
am 

by  t^,  then  it  may  be  said  that  I  has  a  right  triangular 
distribution,  that  is: 

I  ~  RtTrng  (tfl  =  tm,  tfc) 

The  probability  density  function  for  I  is  given  by 

(c/(tb-ta))(tb-t)  if  ta  <  t  <  t„  (12) 

f x( t )  = 

0  otherwise 

The  graph  of  f^  is  shown  in  figure  3.  The  probability  distribution 
function  for  I,  denoted  by  FjCt),  is  given  below  in  equation  13. 

0  if  t  <  t 

a 

FI«0  -  '  l-(tb-t)2/(tb-ta)2  if  ta  <  t  <  tb  (13) 

1  if  t  >  tb 

Since  the  uncertainty  surrounding  software  size  estimates 

almost  alwavs  exhibits  positively  skewed  distributions,  equations 

which  describe  negatively  skewed  right  triangular  size  distributions 

(i.e  t  *  tu)  have  not  been  developed  in  this  paper. 

'  ’  on  b 
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2.3  COMPUTING  THE  EFFORT  DISTRIBUTION  FROM  A  GIVEN 

SIZE  PROBABILITY  DISTRIBUTION 

This  section  provides  the  derivation  of  the  mathematical 
expressions  for  the  density  (fog),  the  distribution  (Fjjg),  the  mean 
(E(DE)),  and  the  variance  (  °Dg)  of  DE  given  that  the  distribution 
of  I  is  triangular.  The  analysis  methodology  is  the  same  if  I  is 
uniformly  or  right  triangularly  distributed.  Table  1  provides  a 
general  summary  of  the  equations  for  fDE.  fde*  E(DE),  and  o  DE  for 
the  cases  where  I  is  uniformly,  triangularly,  or  right  triangularly 
distributed. 

Given  that 

I  ~  Trng 

then  from  equation  4,  the  development  effort  density  fpg  is 
fDE(x)  -  3~1x'1(x/a)1/e  fjCg^U)) 
which  may  be  written  as 


c(tm-  t,)-1  P'1  x-Mx/«)1/P((x/a)^P-  ta)  if  ta  <  (x/a)”*  <  tm  , 

(14) 

c(tm-  tb)-i  p-i  x-i  (x/a)i'P  ((x/a)i'P  -  tb)  if  tm  <  (x/oc)’'P  <  tb 
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From  equation  3,  the  development  effort  distribution  function 
is 


0  if  (x/Ct)1^  <  ta 

(tb-ta)-1  (Via)'1  ((x/a)l;P  -  ta)2  if  taS  (x/a  )1/P  <tm 
1  +  (tb-  ta)'1  (tm-  t*)--!  ((x/a)1'?-  tb)  2  if  V*  s  (x/a)1^  <  tb 
1  if  (x/a)1^  >  tb 


From  equation  5,  the  expected  value  of  the  development  effort  E(DE) 
is 

a  c  (V  ta)-1  -  taP+2)/(p  +  2)  +  (ta^2  -  ta  tme+ ')/([}  +  1)] 

(16) 

+  a  c  (tm-  tb)-1  [(tbP>2  -  t^)/(p  +  2)  +  (tb  tmP->  -  tbP^)/(p  +  1)] 

and  the  variance  of  the  development  effort  Ogj,  based  on  equation  6 
is 

°‘DE  =  a2  E(I2P  )  -  (E(DE))2 

2  3 

where  E(r  )  is  given  by 

c  (tm*  ^[(V^2  -  ta2P-2)/(2P  +  2)  +  (ta2P-2  .  ta  tm2P-i)/(2p  +  1)] 

+  C  (tm-  tb)-1  [  (tb2P^2  .  tm2P-2)/(2(3  +  2)  +  (tb  tm2P-  •  -  tb2P+2)/(2P  +  1 )]  (  1  ?  } 

Table  1  summarizes  these  results  for  the  case  where  1  is 
triangularly  distributed.  Furthermore,  the  equations  for  fDE,  FDE, 
E(DE),  and  for  the  cases  where  1  Is  uniformly  and  right 

triangularly  distributed  are  also  provided  in  table  1. 
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RtTmg  (t,  =  t„  .  t„)  E(DE)RtTr„8  =  a  c  (V,- 1.)-1  [(t,H  •  t,,**2)/<P  +  2)  +  (t^*2  -  It  t,M)/0  +  |)]  a*  c  (tb-  t,)  ’  [(t,,2^2  -  tb  t,2M)/(2p  +  I)  +  (t.2*’2  -  tb^/GP  +  2)] 

-  (E(DE)RITrn8)2 


SECTION  3 


TRANSFORMING  THE  EFFORT  PROBABILITY 
DISTRIBUTION  TO  THE  UNIT  INTERVAL 


This  section  provides  a  method  whereby  the  direct  computation 
of  effort  at  which  there  is  a  given  level  of  risk  can  be  made. 
Consider  the  following  illustration: 

Suppose  the  development  effort  DE  at  the  80th  percentile 
level  of  a  CSCI  whose  size  is  triangularly  distributed 

ca  •=  30,  t  *  50,  and  t^  *  80  needs  to  be  computed. 
Furthermore,  let  Q:  =»  2.8  and  (3 »  1.2  as  used  in  the  COCOMO 
embedded  mode  equation  [lj.  From  equation  1  the  computed 
development  effort  DF.  evaluated  at  the  mode  size 
(informally  referred  to  as  the  most  likely)  t^  *  50  is: 

DF.  -  2.8  ( 50) 1  ‘ 2  *  306  staff  months 

Now  define  Z  as 


0  <  Z  -  Prob(DK  <  306)  »  (t^  -  t  )/(tb  -  ta)  »  0.4  <  1 

where  Z  Is  the  probability  with  which  the  computed 
development  effort  based  on  the  most  likely  size  will 
occur.  Notice  that  Z  approaches  one  as  the  most  likely 
size  approaches  the  absolute  upperbound  size  (assumed  to 
be  given  by  t^) ,  and  Z  approaches  zero  when  the  most  likely 
size  approaches  the  absolute  lower  bound  size  (assumed  to 
be  given  by  tg)  The  value  of  Z,  therefore,  describes  how 
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the  size  distribution  is  skewed.  Figure  4  shows  a  family 
of  development  effort  probability  curves  which  result  when 
Z  varies  between  zero  and  one.  Since  this  illustration 
yielded  Z  *  0.4,  selecting  the  curve  Z^  =  0.4  from  figure 
4,  the  complete  probability  distribution  of  DE  is  shown. 

The  development  effort  at  the  80th  percentile  level  along 
this  curve  occurs  when  n  (the  X  axis  in  Figure  4)  is 
approximately  0.6.  Equations  20  through  23  (to  be  derived 
in  this  section)  with  n  ■  0.6  yields  an  80th  percentile 
development  effort  DE  of  401  staff  months,  the  desired 
result. 

The  following  discussion  develops  the  mathematical  basis  for 
the  family  of  distribution  profiles  shown  in  figure  4,  and  the 
equations  which  determine  the  development  effort  DE  at  any  specified 
level  of  risk. 

Suppose  that 

t  -  Trng(ta,tm,tb, 

where  t  t  and  tu  define  the  absolute  range  of  the  triangular 
a  m  u 

probability  density  function  (pdf)  shown  in  figure  2.  The  size 
estimate  associated  with  the  mode  of  this  pdf  is  given  by  tm’  It  Is 
possible  to  transform  the  general  triangular  pdf  shown  in  figure  2 
to  the  unit  size  interval  through  the  transformation 

0  <  Z  "  (tm  "  V^b  -  ta>“1  <  1  <18> 

where  Z  is  the  exact  representation  of  tm>  but  expressed  as  a  number 
between  zero  and  one.  The  value  of  Z  reflects  the  skewness  of  the 


Figure  A 

The  Distribution  Profile  of 
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pdf.  Furthermore,  Z  is  the  probability  with  which  the  computed 
development  effort  based  on  the  mode  size  DE(tm)  will  occur,  that  is 


Prob(DE  <  DE( t  ))  =  Z 

—  m 

Given  that  I  is  mapped  into  the  unit  interval  via  the  Z 
transformation  shown  in  equation  18,  define  the  dimensionless 
function  e  by 


e 


a  (t  - 


e 


t 

a 


<  t  <  tb 


(20) 


where  the  development  effort  DE  shown  in  equation  1  is  related  to  e 
by 

DE  =  a  [  ( e/  a  )1|/&  (tfa  -  ta)  +  ta]P  ta  <  t  <  tfe  (21) 

Let 

"  »  (t  -  ta)e(tb  -  ta)"e  ta<t<tb  (22) 

from  which 

e  3  an  0<n<l  (23) 


for  some  n  bounded  by  the  unit  interval.  From  table  1  the 
development  effort  probability  distribution  F^  on  the  unit  interval 
(assuming  that  size  I  is  triangularly  distributed)  is 

0  if  (n)1/e  <  0 


Z_1(n)2/e  if  0  <  (n)l/S  <  Z 

Fdf  -  l+(Z-l)_1[(n)l/P  -l]2  if  Z  <  (n)1/P  <  1 

1  if  (n)1/P  >  1 


(24) 
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Equation  24  was  obtained  from  table  t  by  setting  ■  o,  tb  »  1, 
t  »  Z,  and  n  »  x/a  .  Table  2  provides  the  numerical  values  of 
Fpg  for  various  n. 

An  exact  value  of  the  development  effort  at  a  specified 
level  ?  (defined  by  DE y  )  can  be  obtained  by  equations  25  and  26. 
Equations  25  and  26  were  obtained  by  setting  F  equal  to  y  and 
solving  equation  15  for  x. 


DE 


(  a(V(  ^tb-ta)(tffl-ta)]l/2)3  if 

I  a(tb-[(i-M(tb-ta)(tb-tm)]1/2)P 


7  <  Z  (25) 

if  7  >  Z  (26) 
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(n)  Versus  Z  Along  the  Unit  Interval 
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Note  1:  Fni;,(n)  is  rounded  and  computed  at  P 


SECTION  4 


THE  ANALYTIC  SOFTWARE  EFFORT  PROBABILITY  (ASEP)  MODEL 


The  purpose  of  this  section  is  to  introduce  the  Analytic 
Software  Effort  Probability  (ASEP)  model.  The  ASEP  model  is  an 
extension  to  the  equations  developed  in  Section  2  for  the  purpose  of 
approximating  the  development  effort  mean,  variance,  and  probability 
distribution  for  a  system  consisting  of  two  or  more  interrelated 
Computer  Software  Configuration  Items  (CSCIs).  The  Constructive 
Cost  Model  (COCOMO)  [1]  provides  the  framework  around  which  the 
Analytic  Software  Effort  Probability  (ASEP)  model  is  developed. 

Consider  the  hypothetical  n-CSCI  system  shown  in  figure  5. 


de,  de2  de„ 


Figure  5 

An  n-CSCI  System 
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The  Central  Limit  Theorem  [3]  will  be  used  to  contend  that  the 
development  effort  distribution  is  approximately  Gaussian.  The 
Central  Limit  Theorem  states  that  if  x2,  •••,  Xn  are  independent 

random  variables  each  with  finite  mean  M ^  and  variance  ,  then  the 
distribution  of  their  sum  is  asymptotically  normal,  with  a  mean 
equal  to  0^  and  variance  equal  to^^o2  for  i  =  1,  n.  It 

would  be  convenient  to  assert  that  the  total  system  development 
effort  DE  is  approximately  normal,  with  mean  equal  to  the  sum  of  the 
mean  efforts  for  each  CSCI,  and  variance  equal  to  the  sum  of  their 
variances.  However,  the  Constructive  Cost  Model  applied  to  a  system 
of  interrelated  CSCIs  exhibits  the  property  that  the  development 

efforts  DE  DE~  ...  DE  are  not  independent  random  variables. 

1  ’  2  ’  ’  n 

Proposed  below  is  a  method  of  approximating  the  development  effort 
DE  such  that  DE  is  expressed  as  a  sum  of  independent  random 
variables,  from  which  the  Central  Limit  Theorem  is  then  valid. 

Suppose  that  the  size  uncertainty  of  each  CSCI  in  the  n-CSCI 
system  shown  in  figure  3  can  be  characterized  as  being  uniformly  or 
triangularly  distributed.  The  CQCOMG  nominal  development  effort 
relationship  is 


DE  =  a(^^Ij)^  =  1 . n)  (27) 


The  nominal  development  effort  for  each  CSCI  is  given  by 

DEi  =  (Ii)(SIi)~lDE 

r — .  -  1  r — .  3 

“  O  “  1  •••r  <28> 
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Clearly,  the  development  efforts  DE^  are  not  independent  since  a 
change  in  the  size  of  one  CSCI  affects  the  estimated  development 
efforts  of  all  the  other  CSCls  in  the  system.  It  is  possible  to 
find  a  constant  <1  which  provides  a  way  to  express  DE  as  a  sum  of 
independent  random  variables  by  solving  the  expression 

a^L^  =  cp-a^lj  U  =  1,  a)  (29) 

for  i  ,  where  is  the  mean  size  of  CSCI^. 

A 

Let  DE  represent  an  approximation  to  the  overall  development  effort 
DE  given  in  equation  27.  Thus 

DE  =  DAE  =L^DAE.  =  cp^alj  (i  =  1 . n)  (30) 


Thus,  DE  is  now  the  sum  of  independent  random  variables.  Therefore, 
the  mean  of  DE  is  given  by 


>  A 

E(DE)  =  *2-,E(DEi)  (i  =  1,  ...,  n)  (31) 


and  the  variance  of  DE  is  given  by 


2  ~  ••  V'  <■  A 

c  DE  =  '•  2^  z  DE 


(i  =  1 . n) 


where  E(DEj)  and  c‘DE-  ate  computed  for  each  CSCI  from  the 
equations  provided  in  table  1.  The  Central  Limit  Theorem  [3] 
implies  that  for  a  sufficient  number  of  CSCls  the  probability 
distribution  of  DE  is  approximately  Gaussian.  Therefore,  the 
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cumulative  probability  distribution  of  DE  may  be  determined  from  the 
statistical  tables  for  the  normal  distribution  through  the  classical 
transformation 


2  =  (DE-E(DE))/  ODE 

where  z  has  a  standard  normal  distribution.  An  example  of  how  the 
ASEP  model  is  applied  to  a  system  of  interrelated  CSCIs  is  given  in 
Case  B  of  Appendix  A. 
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APPENDIX  A 


MODEL  IMPLEMENTATION  AND  COMPARISON 
TO  OTHER  SOFTWARE  EFFORT  RISK  MODELS 

A.L  GENERAL  CASES 

The  MITRE  Cost  Technical  Center  has  recently  developed,  in 
parallel  to  this  effort,  two  additional  software  effort  risk 
analysis  models.  They  are  the  Taylor  series  approach,  and  the 
RISCOMO  tool.  The  Taylor  series  approach,  developed  by  F.D.  Powell,* 
makes  use  of  Taylor  series  in  approximating  the  mean  and  the 
variance  of  the  development  effort.  The  RISCOMO  tool  is  a 
non-analytic  model  which  employs  a  1000  sample  Monte  Carlo 
simulation  to  measure  the  impact  of  size  uncertainty  on  effort 
estimates.  The  three  cases  presented  in  this  appendix  compare  the 
performance  of  the  Taylor  series  approach,  the  RISCOMO  tool,  and  the 
Analytic  Software  Effort  Probability  (ASEP)  model.  The  Constructive 
Cost  Model  (C0C0M0)  embedded  mode  (a  =  2.8,  3  =  1.2  in  equation  1) 

was  used  to  compute  the  development  efforts  in  each  case. 

A. 1. 1  CASE  A 

This  case  examines  a  single  hypothetical  CSCI  with  the 
technical  characteristics  given  in  table  Al. 

Table  Al 

Case  A  Technical  Characteristics 


CSCI  Size  Distribution  Mean  Size 

_ KPS  I _ KDSI 

l  Trng  (30,50,80)  53.33 


Ilf  .  a 

J 


2.8 


*The  MITRE  Corporation 
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Table  A2  provides  the  development  effort  distribution,  mean, 
and  standard  deviation  which  resulted  from  implementing  the  ASEP 
model,  the  R1SC0M0  tool,  and  the  Taylor  series  approach  on  the 
technical  parameters  of  the  CSCI  given  in  table  Al. 

Since  this  case  Involved  a  single  CSCI,  the  ASEP  model  provided 
the  exact  development  effort  distribution,  mean,  and  standard 
deviation.  The  ASEP  model  results  were  obtained  from  equations  15, 
16,  and  17  (with  a  =  2.8,  t?  =  1.2)  defined  in  section  2.3.  The 
RISCOMO  tool  (based  on  1000  samples)  provided  an  approximation  to 
the  development  effort  distribution,  mean,  and  standard  deviation 
based  on  the  use  of  a  Monte  Carlo  simulation  method.  The  Taylor 
series  approach  relies  on  the  Central  Limit  Theorem  to  contend  that 
the  effort  distribution  is  Gaussian.  However,  since  the  Central 
Limit  Theorem  applies  to  systems  involving  more  than  one  CSCI,  the 
Taylor  series  approach  was  not  used  to  formulate  the  development 
effort  distribution  for  this  case,  but  only  to  approximate  the 
development  effort  mean  and  standard  deviation. 

N'ote  in  table  A2  that  the  difference  between  the  ASEP  model  and 

the  RISCOMO  tool  at  the  lOnth  percentile  (n  =  1 . 9)  did  not 

exceed  1  percent.  At  the  95th  percentile,  there  was  a  2  percent 
difference  between  ASEP  and  RISCOMO.  At  the  99th  percentile,  the 
difference  between  ASEP  and  RISCOMO  ranged  from  0.4  percent  to  5.6 
percent . 


The  RISCOMO  tool  and  the  Taylor  series  method  yielded 
approximations  to  the  development  effort  mean  and  standard  deviation 
which  evidenced  no  appreciable  difference  with  the  exact  development 
effort  mean  and  standard  deviation  generated  by  the  ASEP  model. 
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Table  A2 

Case  A  Development  Effort 
Cumulative  Probability  Distribution 


Development  Development  Effort 


Effort  Distribution 

ASEP 

RISC0M0 

Taylor  Series 

(Percentile) 

Model 

Tool 

Approach 

10 

234.2 

231.9 

20 

263.6 

268.5 

— 

30 

286.5 

288.8 

— 

40 

306.  l 

309.2 

— 

50 

325.4 

325.5 

— 

60 

347.0 

349.9 

— 

70 

371.7 

370.2 

— 

80 

401.5 

402.8 

— 

90 

440.8 

439.4 

— 

95 

468.9 

459.7 

— 

99 

506.9 

480. 1-504. 

.5 

Mean 

332.3 

334.9 

330.8 

Standard  Deviation 

76.6 

76.5 

76.5 

Note:  l.  Due  to  Its  reliance  on  the  Central  Limit  Theorem,  the 

Taylor  series  approach  was  not  used  to  formulate  the 
development  effort  distribution  in  this  case. 
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A.  1.2  CASE  8 


This  case  examines  a  2-CSCI  interrelated  system  with  the 
technical  characteristics  given  In  table  A3. 

Table  A3 

Case  B  Technical  Characteristics 


CSC!  Size  Distribution  Mean  Size  Development  Effort  Multipliers 
KDS I  KDSI  RELY  CPLX  DATA  TIME 


1  Trng  (2,5,10)  5.666  VH  H  *  * 

2  Trng  (7,9,20)  12.0  VH  *  H  H 


Note:  1.  These  are  C0C0M0  parameter  ratings  -  VH  refers  to  Very 

High,  H  refers  to  High,  and  *  refers  to  a  Nominal  rating. 


To  use  the  ASEP  model,  it  was  necessary  to  compute  DE ^  and  DE2 
based  on  the  mean  size  of  each  CSC!  given  in  table  A3.  Applying 

C0C0M0  to  the  technical  parameters  of  CSCI  1  using  the  mean  size  of 

A 

5.666  KDSI  yielded  a  development  effort  DE^  of  36.0  staff  months. 
From  equation  1  (with  a  =  2.8,  P=  1.2,  and  T  *  5.666) 


Ilf  x  =  1.604 

ai  =  2.8  (  nf^)  =  4.491 


where  Hf^ 
CSCI  1. 


is  the  product  of  the  development  effort  multipliers  for 


Similarly,  applying  the  C0C0M0  model  to  the  technical 

parameters  of  CSCI  2  using  the  mean  size  of  12  KDSI  yielded  a 

A 

development  effort  DE2  of  94.2  staff  months. 


From  equation  1  (with  a  =  2.3,  r  =  1.2,  and  T.;=  12.0),  It  may  be 
determined  that 

;:f  =  1.705 

)2 

-<2  -  2.3  (  :: f  >  =  4.775 

Let  DK^,  be  the  COCOMO-generated  total  development  effort  of  the 
system  based  on  the  mean  size  of  each  CSCI,  and  modeled  as  an 
interrelated  hierarchy.  This  was  accomplished  through  the  use  of  an 
automated  version  of  C0C0M0  which  resides  on  the  MITRE  Lfltrix 
svstem.  The  resultant  DE^,,  given  the  technical  parameters  in  table 
A3,  was  147.6  staff  months.  From  equation  29. 

147.6  =  .  (36.0  +  94.2) 

thus  -.  =  1.133.  Using  equations  31  and  32,  respectively, 

F.(l)K)  =  1  .  1  3  3  [  E  ( DE  j  )  +  E(nE7)]  =  1  .  133  (36.373  +  94.329)  =  148.7 

/2F)E  =  ( 1 .  1  33  )  ’  (  ' 2 DE  ^  +  c2DE9)  =  (1.  1  3  3 ) 2  [  158-9  +  738.124)  =  11  52.8 

where  E(DE^ ) ,  E(DE7),  c  DEj ,  DE2  were  computed  from  the 
equations  provided  in  table  l-  Therefore,  given  the  Case  B 
technical  baseline  provided  In  table  A3,  the  ASEP  model 
approximation  of  the  development  effort  mean  and  standard  deviation 
is 


E(DE)  =  148.7  staff  months 
o  DE  =  (U52.8)1,/“  =  33.9  staff  months 

Table  A4  compares  these  measures  of  central  tendency  with  the 
results  generated  by  the  RISC0M0  tool  and  the  Taylor  series 
approach . 
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Table  A4 

Case  8  Development  Effort 
Mean  and  Variance  Estimates 


Development 

Mean 

Effort 

Standard  Deviation 

ASEP  Model 

148. 7 

33.9 

RISCOMO  Toot 

147.4 

33.9 

Tavlor  Series 

148.0 

33.1 

Fach  model  evidenced  no  appreciable  difference  in  their 
approximations  to  the  development  effort  mean  and  standard 
deviation. 

The  RISCOMO  tool  should  be  used  in  this  case  to  formulate  the 
development  effort  distribution,  because  it  is  empirically  derived 
through  a  Monte  Carlo  simulation.  Since  Case  B  involves  a  system 
comprised  of  just  two  CSCIs,  the  Central  Limit  Theorem  (used  to 
contend  that  the  development  effort  is  Gaussian)  is  not  particularly 
strong  for  such  a  small  sample.  Thus,  the  ASEP  model  and  the  Taylor 
series  approach  would  not  be  appropriate  in  formulating  the  effort 
distribution  in  this  case. 

Case  C  describes  a  15-CSCI  system  where  the  effort 
distributions  formulated  by  the  ASEP  model,  the  RISCOMO  tool,  and 
the  Taylor  series  approach  may  be  compared. 


A  •  1 .  3  CASE  C 


This  case  examines  a  hypothetical  system  of  multiple  CSCIs  with 
the  technical  characteristics  given  in  table  A5. 

Table  A5 

Case  C  Technical  Characterist ics 


CSCI  Size  Distribution 
DSI 


Mean  Size 
DSI 


a 


j 


I 

RtTrng  (6437,  8883) 

7253 

2.268 

6.3  50 

2 

RtTrng  (1287,  1660) 

1412 

2.449 

6.859 

3 

RtTrng  (7750,  9222) 

8241 

2.4  74 

6.929 

4 

RtTrng  (8525,  12276) 

9775 

2.124 

5.949 

5 

RtTrng  (12231,  18347) 

14270 

2.278 

6.378 

6 

RtTrng  (8000,  14800) 

10266 

2.401 

6.723 

7 

RtTrng  (6975,  9347) 

7765 

1.406 

3.937 

8 

RtTrng  (32650,  40812) 

35371 

2.420 

6.777 

9 

RtTrng  (9000,  11250) 

9750 

1.904 

5.333 

10 

RtTrng  (3219,  4023) 

3487 

1.524 

4.268 

11 

RtTrng  (1288,  1609) 

1395 

1.515 

4.243 

12 

RtTrng  (21700,  31899) 

25099 

2.586 

7.243 

13 

RtTrng  (10075,  10881) 

10343 

2.401 

6.724 

14 

RtTrng  (64375,  67593) 

65448 

1.265 

3.543 

15 

RtTrng  (32188,  33797) 

32724 

1.265 

3.543 

Table  A6  provides  approximations  to  the  development  effort 
distribution,  mean,  and  standard  deviation  generated  by  the  ASEP 
model,  the  RISCOMO  tool,  and  the  Taylor  series  approach. 
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Table  A6 

Case  C  Development  Effort  Cumulative 
Probability  Distribution 


Development 

Effort  Distribution 
(Percentile) 

ASEP 

Model 

Development 

RISCOMO 

Tool 

Effort 

Taylor  Series 
Approach 

10 

3742 

3741 

3743 

20 

3785 

3780 

3785 

30 

3816 

3806 

3815 

40 

3842 

3833 

3841 

50 

3867 

3858 

3865 

60 

3892 

3885 

3889 

70 

3918 

3911 

3915 

80 

3949 

3938 

3945 

90 

3992 

3990 

3987 

95 

4028 

4028 

4022 

99 

4095 

4095 

4087 

Mean 

3867 

3866 

3865 

Standard  Deviation 

98 

95 

95 

Note  that  the  difference  between  the  ASEP  model,  the  RISCOMO 
tool,  and  the  Taylor  series  approach  at  the  lOntli  percentile  (n  *  1, 
...,  9.9)  does  not  exceed  0.3  percent. 
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A. 2  SUMMARY  REMARKS 


Extending  the  analytical  model  to  an  n-CSCI  system  provided  a 
method  for  approximating  E(DE),  a2 DE,  and  the  development  effort 
distribution  while  making  use  of  the  closed  form  equations  developed 
in  Section  2. 

Case  A,  where  the  system  consisted  of  a  single  CSCI,  the  ASEP 
model  provided  the  exact  values  of  E(DE),  caDE,  and  the  development 
effort  distribution.  The  RISCOMO  tool  provided  an  approximation  to 
E(DE),  c2DE,  and  the  development  effort  distribution  through  the  use 
of  its  Monte  Carlo  simulation  algorithm.  In  Case  A,  the  Taylor 
series  approach  also  provided  good  approximations  to  E(DE),  and 
c2DE,  however,  due  to  its  reliance  on  the  Central  Limit  Theorem  in 
assuming  that  the  development  effort  distribution  is  Gaussian,  a 
larger  number  of  CSCIs  (more  than  the  single  CSCI  given  in  Case  A) 
is  necessary  in  order  to  use  the  Taylor  series  approach  to  formulate 
the  development  effort  distribution  in  this  case. 

Case  B,  where  the  system  consisted  of  just  two  CSCIs,  the  ASEP 
model,  the  RISCOMO  tool,  and  the  Taylor  series  approach  yielded  no 
appreciable  difference  in  their  approximations  to  E(DE)  and  o2DE. 
Since  the  ASEP  model  applied  to  a  system  of  two  or  more  interrelated 
CSCIs  assumes  (like  the  Taylor  series  approach)  that  E(DE)  and  o2DE 
are  Gaussian,  it  should  not  be  used  to  obtain  the  development  effort 
distribution  for  Case  B.  This  case  is  an  example  of  where  the 
RISCOMO  tool  is  especially  useful.  Since  RISCOMO  is  based  on  a 
Monte  Carlo  simulation  approach,  its  development  effort  distribution 
is  empirically  derived,  hence  the  size  of  the  sample  (i.e.,  the 
number  of  interrelated  CSCIs  comprising  the  system)  does  not  affect 
the  RISCOMO  formulation  of  the  development  effort  distribution. 
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Furthermore,  in  addition  to  systems  consisting  of  just  a  few  CSCIs, 
the  RISCOMO  tool  is  useful  for  approximating  the  development  effort 
distribution  for  CSCIs  with  severely  skewed  size  distributions. 


Case  C  consisted  of  enough  CSCIs  to  show  that  the  performance 
of  the  ASEF  model,  the  RISCOMO  tool,  and  the  Taylor  series  approach 
could  be  compared.  For  the  scenario  given  in  Case  C, 
the  development  effort  distributions  formulated  by  each  model 
revealed  no  appreciable  difference. 

To  summarize,  the  ASEP  model  has  the  feature  of  producing  exact 
values  of  E(DE),  c2DE,  and  the  development  effort  distribution  given 
tile  case  of  a  single  CSCI.  For  systems  consisting  of  many  CSCIs,  the 
ASEP  model  produces  approximations  to  these  measures.  Its  analytic 
nature  makes  the  ASEP  model  computationally  easy  and  inexpensive  to 
execute  on  a  computer. 

The  Taylor  series  approach  is  similar  to  the  ASEP  model  in  the 
sense  that  it  is  an  analytic  model.  However,  unlike  the  ASEP  model, 
the  Taylor  series  approach  cannot  be  used  to  formulate  the  develop¬ 
ment  effort  distribution  of  a  single  CSCI  (as  in  Case  A)  due  to  its 
reliance  on  the  Central  Limit  Theorem.  By  reason  of  its  analytic 
nature,  the  Taylor  series  approach  also  has  the  feature  of  being 
computationally  inexpensive  in  computer  usage. 

The  RISCOMO  tool  can  be  applied  to  a  broad  set  of  scenarios, 
however,  it  is  not  computationally  straightforward,  and  can  become 
expensive  in  computer  usage  due  to  its  Monte  Carlo  simulation 
nature.  RISCOMO  is  especially  useful  in  cases  Involving  a  few 
CSCIs  and  in  circumstances  where  the  size  distributions  of  a  system 
of  CSCIs  are  severe ly  skewed. 
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